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STREAMING SIMD EXTENSIONS 2 INSTRUCTION SET INTEL CORPORATION 

SHUFPD — Shuffle Double-Precision Floating-Point 



Instruction Description 

SHUFPD xmml, xmm2/ml2S, immS Shuffle packed double-precision floating-point numbers. 



Description 

Shuffles cither of the two packed double-precision floating-point numbers from xmml to the 
low quad word of xmml; shuffles either of the two packed double-precision floating-point 
numbers from xmm2/rn128 to the high quadword of xmml. Bit 0 of the immediate field selects 
which of the two input double-precision floating-point numbers will be put m the low quadword 
of the result; bit 1 selects which of the iwo input double-precision floating-point numbers will 
be put in the high quadword of the result. 



X2 


XI 


xmml Vs. 


J Y2 


\ \ Y1 


xmm2/m 1 28 




xmml ^^""""^ ^ 




{Y2 ... Yl} 


{X2 .XI) 



Operation 

fp..setect = (imm8 » 0) & 0x1; 
xmm1[65-0] = (fp^select =- 0) ? xmmt 163-0] : 
(fp_select == 1) ? xmm1[127-64J; 
fp_seiect = (imm8 » 1) & 0x1 ; 

xrrunl [127-64] = (fp_select = 0) ? xrnm2/m128[63-<)] : 
(fp .select = 1) ? xmm2/m128[1 27-64]; 

Protected Mode Exceptions 

#GP(0) For an illegal memory operand effective address in the CS, DS, ES, FS or 

GS segments. 

If memory operand is not aligned on a 16-byte boundary, regardless of 
segment. 

#SS(0) For an illegal address in the SS segment. 

#PF(fault-codc) For a page fault 



>E2 
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#NM Tf TS bit in CRO is set. 

#XM F«r an unmasked Streaming STMD Extensions 2 instructions numeric 

exception (CR4 .OSXM MEXCPT =1). 
#UD For an unmasked Streaming SIMD Extensions 2 instructions numeric 

exception (CR4.0SXMMEXCPT =0). 

IfCRQ.EM^l, 

If CR4.0SFXSR(bU 9) = 0. 

If CPU1D.WNI(EDX bit 26) - 0. 

Real-Address Mode Exceptions 

#GP(0) Tf memory operand is not aligned on a 16-byte boundary, regardless of 

segment. 

Interrupt 13 If any part of the operand lies outside the effective address space from 0 

toOFFFFH. 

#NM If TS bit in CRO is set 

#XM For an unmasked Streaming STMD Extensions 2 instructions mimetic 

exception (CR4,OSXMMEXCPT =1). 
#UD for an unmasked Strcarning SIMD Extensions 2 instructions numeric 

exception (CR4.0SXMMEXCFT =0). 

IfCR0.EM=L 

if CR4.0SFXSR {bit 9) = 0. 

If CPUID.WN(F-DXbit 26) = 0. 

Virtual-8086 Mode Exceptions 

Same exceptions as in Real Address Mode 
#PF(fault-code) For a page fault > 

Numeric Exceptions 

None. 
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STREAMING SIMO EXTENSIONS 2 INSTRUCTION SET INTEL CORPORATION 

PSHUFD— Packed Shuffle Doubleword 



Instruction Description 

PSHUFD xmml, xmm2/m128, imm8 Shuffle the tfwbJewonJs in xmm2/m$m1 28 based on the 
encoding in imm8 and store result in xmml . 



Description 

Shuffles the four doublewords in xmm2/Tnem 1 28 in the order selected by imm8 and stores the 
result in xmral. Bits 1 and 0 of imm8 encode the source for destination doubleword 0 
(xmm 1 [3 1-0]), bits 3 and 2 encode for doubleword 1 . bits 5 and 4 encode for doubleword 2, and 
bits 7 and 6 encode for doubleword 3 (xmm 1 [ 1 27-96])- Similarly, the two bit encoding repre- 
sents which source doubleword is to be used, e.g., an binary encoding of 10 indicates that source 
doubleword 2 (xmm2/meml28[95-64J) will be used. 

Operation 

xmm1[31-0] = (xmm2/m128 » 0mm8[1-O] * 32) )[31-0] 
xmml [63-32] = (xmm2/m128 » (immS[3-2] * 32) )[31-0] 
xmml [95-64] = (xrnm2/m128 » (imm8[5-43 * 32) )[3l-0] 
xmml [127-96] = (xmm2/m128 » (immS[7-6] * 32) )[31-0] 

Protected Mode Exceptions 

#GP(0) For an illegal memory operand effective address in the CS, DS, ES, FS or 

GS segments- 

If memory operand is not aligned on a 16-byte boundary, regardless of 
segment. 

#SS(0) For an illegal address in the SS segment, 

#PF(fauit-code) For a page fault 
#NM If TS bit in CRO is set. 

#UD IfCR0,EM=l. 

If CR4,OSFXSR(bit 9) ~ 0. 

If CPU1D.WN1(£DX bit 26) = 0. 

Real-Address Mode Exceptions 

#GP(0) If memory operand i* not aligned on a 16-byte boundary, regardless of 

segment. 

Interrupt 13 If any part of the operand lies outside the effecdve address space from 0 

to OFFFFH. 

#NM If TS bit in CRO is set. 



5-174 
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INTEL CORPORATION STREAMING SIMD EXTENSJONS 2 INSTRUCTION SET 



frb'D irCR0,BM= 1. 

lfCR4.0SFXSR(bit 9) = 0. 
Tf CPUlD,WN(l£DX bit 26) = 0, 

Virtu a 1-3 08 6 Mode Exceptions 

Same exceptions as in Real Address Mode 
#PF<fault-code) For a page fault. 

Numeric Exceptions 

None. 
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STREAMING SIMD EXTENSIONS 2 INSTRUCTION SET INTEL CORPORATION 



PSHUFHW— Packed Shuffle High Words 



Instruction Description 

PSHUFHW xmml , xmm2Arti2$ f imm8 Shuffle the high wortfs in xmm2/mGm128 based on the 

encoding in imrrig and store the result in xmml . 



Description 

Shuffles the four high words in xmm2/mt;ml2S in the order selected by im.m8 and stores the 
result in the high quadword of xmml . Bits 1 and 0 of imm8 encode the source for destination 
word 4 (xmml [79-64]), bits 3 and 2 encode for word 5, bits 5 and 4 encode for word 6, and bits 
7 and 6 encode for word 7 (xmml[l27-t 1 2])- Similarly, the two bit encoding represents which 
source word is to be used, e.g., a binary encoding of 10 indicates that source word 6 
(XMM2[1 11-96] or Mem[ll 1-96]) will be used. The low quadword of the destination register 
is written will) the low 64 bits of the source register. 



Operation 

if (source = m128){ 

xmml {79-641 = (m128 » (imm8[1-0] - 16) )[79-64] 
xmrnl [95-80] - (ml28 » (imm8[3-2] * 16) )[79-64] 
xmm1[1 1 1-661 = (m128 » (irnrn8[5-4j * 16) )[79-64] 
xmm1[127-112] = (ml28 » (imm8[7-6] * 16) )[79-64] 

xmm1[63-0] = m128[63-0I; 
} else { 

xmml [79-64] = <xmm2 » (imm8[1 -0] * 1 6) )[79-64] 
xmm1[95n80] = (xmm2 » (imm8[3-2] * 16) )J7&^4] 
xmm1[1 11-95] - (xmm2 » (imm8[^4] * 16) )[79-64] 
xmm1[127-112] « (xmrn2 » (irnm8[7-6] * 16) )[79-64] 



} 



xmm1[63-0] = xmm2[63-0]; 



Protected Mode Exceptions 

££P(0) For an illegal memory operand effective address in the CS, DS, ES, FS or 

GS segments. 

If memory operand is not aligned on a 16-byte boundary, regardless of 
segment 

#$S(0) For an illegal address in the SS segment 

#PF(fauit-code) For a page feult. 
£fc-M If TS bit in CRO is set. 

#UD If CRO.EM - I. 
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If CR4_OSFXSR(bit 9) = 0. 

If CPUID.WNKUDX bit 26) - 0. 

Real-Address Mode Exceptions 

#GP(0) If memory operand is not aligned on a, 16-byte boundary, regardless of 

segment. 

Interrupt 1 3 If anv part of the operand lies outside the effective address space from 0 

toOFFFFH. 

#NM If TS bit in CRO is set 

#UD IfCR0.EM= 1. 

IfCR4.0SFXSR(bu 9) = 0. 

If CPU1D.WN(EDX bit 26) = 0. 

Virtual-8086 Mode Exceptions 

Same exceptions as in Real Address Mode 
#PF(fauIt-code) for a page fault. 

Numeric Exceptions 

None. 
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STREAMING SIMD EXTENSIONS 2 INSTRUCTION SET 



INTEL CORPORATION 



PSHUFLW— Packed Shuffle Low Word 



Instruction 


Description 


PSHUFLW xmml , xmm2/m128. imm8 


Shuffle th© low words in xmm2/mem12S based on the 
encoding in imm8 and store in xmml. 



Description 

Shuffles the four low words in xram2/mcml28 in the order selected by imrnS and stores the 
result in the low quadword of xmml.Bits 1 and 0 of immS encode the source for destination 
word 0 (xmm ! [ 1 5-0]), bits 3 and 2 encode for word 1 , bits 5 and 4 encode for word 2, and bits 
7 and 6 encode for word 3 (xmin 1 [63-48]). Similarly, the two bit encoding represents which 
source word is to be used* e.g., an binary encoding of 10 indicates that source word 2 
(xmm2Ancml28[47-32)) will be used. The high quadword of the destination register is written 
with the high 64 bits of the source register. 



Operation 

if (source ==m128){ 

xmm1[15-0] = (m128 » (imm8[1-0] * 16) )[15-0] 
xmm1[3l-l6] = (m128 » (immSfS-2] * 16) )[15-0] 
xmm1[47-32] = (m128 » (irnm8[5-4j * 16) )[15-fJ] 
xmml [63-48] * <ml28 » (imm8[7-6] * 16) )[15-0] 

xmml [127-64]= ml 28[1 27-64]; 
} efse { 

xmm1[15-0] = (xmm2 » (immS[1-0] * 16) )[15-0] 
xmm1[31-16l = (xmm2 » (imm8[3-2] * 16) )[15-0] 
xmm1[47^] = (xmm2 » (imm8[5-4] * 16) )[154] 
xmm1[63-48] = (xmm2 » (imm8|7-6J * 16) )[15-0] 



} 



xmm1[l27-64l * xmm2[1 27-64]; 



Protected Mode Exceptions 



#GP(0) 



#SS(0) 

#Pf(fault-code) 



For an illegal memory operand effective address in the CS, E>S } ES ? FS or 
GS segments. 

If memory operand is not aligned on a 16-byte boundary, regardless of 
segment. 

1-or an illegal address in the SS segment 
For a page fault. 
JfTS bit in CROisscL 
IfCR0JEM= 1. 
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INTEL CORPORATION STREAMING SIMD EXTENSIONS 2 INSTRUCTION SET 

If CR4X)SFXSR(bit 9) 0. 

If CPU1D.WNM(HDX bit 26) - 0. 

Real-Address Mode Exceptions 

#GP(0) If memory operand is not aligned on a 16-byte boundary, regardless of 

segment. 

Interrupt 13 If any part of the operand lies outside the effective address space from 0 

to OFFFFH. 

#NM If TS bit in CRO i& set. 

#UD IfCR0.EM=I. 

If CR4.0SFXSR (bit 9) - 0. 

If CPUTD. WN(EDX bit 26) = 0. 

Virtuai-8086 Mode Exceptions 

Same exceptions as in Real Address Mode 
#PF(faulr-code) For a page fault. 

Numeric Exceptions 

None. 
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